Applying Statistical Models for The Sample Data Production Process of Data Warehouse
نویسندگان
چکیده
For data warehouse and OLAP (On-Line Analytical Processing) systems various preparation tasks are necessary to qualify and improve the effectiveness of their usage. In order to put into operation such a task the creation of statistical sound sample data is indispensable. Applying statistical models for the generation of sample data promises a high-quality prospect which allows the production of sample data based on different usercontrollable statistical parameters. The sample data, which follows the defined statistical requirements, can be conveniently used for testing, benchmarking, demonstrating and training in the fields of data warehouse. In this paper, we address the activities of generation sample data; classify the generating methods and using BEDAWA [3] tool as illustration purposing to generate sample data. Furthermore, we discuss the uses of sample data in the field of data warehouse.
منابع مشابه
Improvement of the Analytical Queries Response Time in Real-Time Data Warehouse using Materialized Views Concatenation
A real-time data warehouse is a collection of recent and hierarchical data that is used for managers’ decision-making by creating online analytical queries. The volume of data collected from data sources and entered into the real-time data warehouse is constantly increasing. Moreover, as the volume of input data to the real time data warehouse increases, the interference between online loading ...
متن کاملارائه مدل تلفیقی برای ارزیابی آمادگی سازمان ها جهت پیاده سازی سیستم انباره داده با استفاده ازتحلیل سلسله مراتبی
Enterprise Data Warehouse initiative is a high investment project. The adoption of Data Warehouse will be significantly different depending upon the level of readiness of an organization. Before implementation of Data Warehouse system in a firm, it is necessary to evaluate the level of the readiness of firm. A successful Data Warehouse assessment model requires a deep understanding of opportuni...
متن کاملModeling and Simulation of Polyhydroxybutyrate Production by Protomonas extorquens in Fed-batch Culture
Modeling and simulation of Polyhydroxybutyrate (PHB) production by Protomonas extorquens in fed-batch culture were conducted in this research. The fed-batch model, developed for this process, employed a kinetic model proposed by other researchers. Several kinetic models were investigated to choose the best model. The criterion for this selection was goodness of fit (δ2). Haldane kinetic model w...
متن کاملModel Selection for Mixture Models Using Perfect Sample
We have considered a perfect sample method for model selection of finite mixture models with either known (fixed) or unknown number of components which can be applied in the most general setting with assumptions on the relation between the rival models and the true distribution. It is, both, one or neither to be well-specified or mis-specified, they may be nested or non-nested. We consider mixt...
متن کاملModeling Ghotour-Chai River’s Rainfall-Runoff process by Genetic Programming
Considering the importance of water and computing the amount of rainfall runoff resulted from precipitation in recent decades, using appropriate methods for predicting the amount of runoff from rainfall date has been really essential. Rainfall-runoff models are used to estimate runoff generated from precipitation in the catchment area. Rainfall-runoff process is totally a non-linear phenomenon....
متن کامل